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A method and device for generating display frames from a sequence of source frames 
through synthesizing one or more intermediate frames exclusively from an immediately 
preceding source frame. 



BACKGROUND OF THE INVENTION 

The invention relates to a method as recited in the preamble of Claim 1 . 
US 4,736,248 discloses how to generate display frames by interpolating between source 
frame pairs. The transformation algorithm is derived from point pairs that occur in both of 
5 two successive source frames. The interpolation uses the same transformation for other pixel 
pairs that occur in both of these source frames. Sometimes a particular source pixel is 
present in only one of the two source frames, so that only for that particular source pixel ex- 
trapolation must be effected. The reference intends to improve picture rendering in dynamic 
aerial survey. 

10 A different field of use pertains to highly interactive computer games and 

similar multimedia environment types. Here, it is important to have a high frame rate, so 
that displayed motion will be as smooth as possible. In principle this will also allow minimal 
latency between user-initiated events and visual feedback connected therewith. Such is especi- 
ally important for navigation control, like in flight simulation games. It has been suggested to 

15 raise the frame rate to a value comparable to the display refresh rate that may be 60-72 Hz. 
It has furthermore been found that Virtual Reality (VR) systems need low latency to protect 
a user person against motion sickness. However, the general use of interpolating according 
to the reference will introduce additional latency, because the various interpolation 
parameters will only be known after reception of the later source frame, even if among all of 

20 the pixels, certain display frame pixels will only depend on past source frame pixels. 

SUMMARY TO THE INVENTION 

In consequence, amongst other things, it is an object of the present 
invention to avoid the latency increase caused by overall interpolation. Now therefore, 
25 according to one of its aspects, the invention is characterized according to the characterizing 
part of Claim 1 . The Z-buffer is generated during the rendering of the source frames that is 
not part of the invention. The Z-buffer may be used to convert the 2-D-frames into 3-D 
space, so that changes in perspective as well as arbitrary 3-D camera rotations and 
translations may be implemented (Eq.8). These two transform types are the main causes for 
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the changing of a scene. 

A specific problem caused by extrapolating is that a scene part which was 
obscured in an earlier source frame may subsequently be uncovered in an extrapolated fra- 
me, because the obscuring object has moved in a transverse manner with respect to the 
5 obscured part. A solution is attained through lateral extrapolation from adjacent pixels that 
had not been subject to obscuring in the preceding source frame. The extrapolation may go 
along with or counter to the scanning direction. If the extrapolating operates on a background 
pattern or another entity with coarse granularity, the result is usually true or nearly true. On 
the other hand, the effect of a small item that would suddenly be uncovered from behind an 

10 obscuring object, will be ignored until arrival of the next source frame. Usually, the effect of 
this faking is allowable. Contrariwise to the above, restricting the extrapolation to a 2D-affi- 
ne transform would often create unreasonable distortions in the extrapolated frames, because 
a hitherto obscuring but moved part would now be extended a great deal. This problem is 
solved by making the extrapolation specifically dependent on the depth (zv2) expressed in the 

15 view coordinates of Eq. 12. 

A display sequence may consist of the source frames together with 
intermediate frames. Alternatively, all or certain source frames may be left unused provided 
that the intermediate frames occur often enough. The inventive technology keeps display fra- 
me latency small. On the other hand, interpolating display frames may often increase latency 
20 to an unwanted extent. 

The invention also relates to a frame-based device arranged for practising 
the recited method. Further advantageous aspects of the invention are recited in dependent 
Claims. 

The invention allows to calculate pixel displacements for producing 
intermediate frames between the frames generated by the 3-D rendering pipeline itself. The 
displacements may be calculated incrementally during scan conversion of the source frames, 
through consistently using results attained for an immediately adjacent pixel along the scan 
line. This requires for each next pixel only a low number of arithmetical calculations on the 
basis of pixels treated earlier. 

The invention allows to use known camera motion with respect to a most 
recent source frame for producing instantaneous coherence between pixels of this source 
frame and pixels of an immediately following synthesized display frame according to Eq. 9. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

These and further aspects and advantages of the invention will be 
discussed more in detail hereinafter with reference to the disclosure of preferred 
embodiments, and in particular with reference to the appended Figures that show: 
5 Figures 1A-1C, three display frame producing methods; 

Figure 2, a perspective overview of the generating; 
Figure 3, relations between device and view coordinates; 
Figure 4, the same as seen in perspective; 
Figure 5, a device for implementing the invention; 
10 Table 1 gives various mathematical expressions, 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

Figure 1A shows a First method for raising the display frame rate. The top 
row has a time-sequence of uniformly spaced source frames "a". Between each pair of con- 

15 tiguous source frames an extra frame "b" is interpolated, as indicated by arrows. By using as 
display frames both the original a frames as well as the interpolated b frames, the rate is 
doubled. The latency is appreciable, because a b frame cannot be generated before the next a 
frame will have become available. 

Figure IB shows a second method for raising the display frame rate. 

20 Between each pair of contiguous source frames an extra frame "c" is extrapolated being 

based exclusively on the last preceding a frame, which has been indicated by arrows. To use 
as display frames both the original a frames and the interpolated c frames will double the 
rate. Latency is less than in Figure 1A, because a c frame may be generated immediately 
when the preceding a frame has become available. 

25 Figure 1C shows a third method for raising the display frame rate. 

Between each pair of immediately contiguous source frames an extra frame "d" is extrapola- 
ted exclusively based on the last preceding a frame, as indicated by arrows. In contradis- 
tinction to Figure IB, the delay between a source frame and its extrapolated d frame is 
nonuniform. Using half of the original a frames as well as the interpolated d frames will 

30 raise the rate by a factor of 50%. Latency is less than in Figure 1A, because a d frame may 
be generated immediately when the preceding a frame has become available. 

Figure 2 shows a perspective overview of the generating environment as 
based on an XYZ-coordinate system. Actual camera position is in the origin with the axis of 



ftcmncinn cucct /r»m «- r»j » 



WO 99/51029 4 PCT7IB99/00486 

view in the negative Z-direction; the camera has an ^"-direction along the positive Y-axis. 

Figure 3 shows geometrical relations between device and view coordina- 
tes. The camera is again in the ORigin with its axis in the negative Z-direction. The 
horizontal axis in the Figure is the X^-axis, the Y^-axis is perpendicular to the plane of 
5 the drawing. The object OBJ has been shown projected for display on the screen at x- 
coordinate x,. As shown, the Figure has a perspective or central projection. Parallel pro- 
jection is a feasible alternative. 

Figure 4 shows this geometry in three-dimensional space. Far at left is the 
eye point of the camera position. The far clipping plane fcp at right and the near clipping 

10 plane ncp in the centre are perpendicular to the viewing axis, and together delimitate a 
viewing angle frustum. Similar clipping planes extend in the x- and y-directions. In the 
projection plane the signed distances top, bottom, left and right have been indicated. Fur- 
ther, the distances near and far between the origin and the projection plane and the object 
plane have been indicated. Also the height angle fi has been shown. 

15 Figure 5 shows a device for implementing the invention. The set-up is 

centred around communication bus 20 and main memory 30. The information to be 
processed is fetched from memory and sent to processing modules 22, 24, 26 that are con- 
nected in series to symbolize successive operations as will be listed in the tables hereinafter. 
The processing modules may be collectively mapped on shared hardware such as a CPU. 

20 After processing and possibly transient storage, the successive display frames attain 

subsystem 34 for display to a user. Overall control subsystem 28 is connected to the proces- 
sing modules via control lines 23, 25, 27, that may lead via bus 20. In this manner, the data 
may in the CPU be translated into an image. Control subsystem 28 may receive control and 
data signals from various sources, such as manual or other inputs 32 from an actual user, ex- 

25 ternal signals from other users in an interactive environment, or from cameras or farther sub- 
systems not shown for brevity. 

MATHEMATICAL FORMULATION OF THE PROCEDURES 

Generally, due to camera rotating, translating or zooming, the complete 
30 frame content may change with time. If the geometrical relations between the previous 
camera position and direction and the new camera position and direction are known, the 
coherence between the pixels of the old source frame and the new intermediate display frame 
to be extrapolated will also be known. Table 1 gives various mathematical expressions, 
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including relations between various coordinates, such as view space coordinates, projection 
coordinates, normalized device coordinates and screen-space coordinates. 

First, equation (1) is the perspective projection matrix. It defines the 
various quantities near, far, top, bottom, left and right plane positions of the viewing 
5 frustrum shown in Figure 4. Equations (2) define a Symmetrical Viewing Frustrum; an 

unsymmetrical frustrum may be used as well. Next, equations (3) define the use of the height 
angle fi and aspect ratio a to further specify the viewing frustrum. Equation (4) expresses the 
relations between the projection coordinates (postscripted p) and homogeneous view 
coordinates (postscripted v) via the projection matrix P. Other projections such as orthogonal 

10 are also feasible. Equation (5) gives normalized device coordinates (subscripted n). Finally, 
equation (6) gives the screen coordinates (postscripted s). Straightforward insertion derives 
the ultimate relations between screen coordinates and view coordinates as given in (7). Using 
{xvl, yvl, zvl} and {xsl, ysl} as the coordinates in the source frame, then {xvl, yvl} can 
be written according to equation (8). 

15 Matrix transformation of the view coordinates due to camera motion 

and/or object changes is given in (9). Equation (10) gives the coordinates in the extrapolated 
frame, so that the expressions for (xs2, ys2) can be straightforwardly found as given in (11). 
Further, (12) is the expression for zv2. Now, fz is linearly related to the screen coordinates 
xsl and ysl , and fz may be incrementally computed during scan conversion by adding a con- 

20 stant delta that is the time derivative of the movement. Hence calculating zv requires the 
following: 

- add delta to fz 

- multiply by zvl 

- add to m23 

25 Further, xs2 and ys2 can be written according to expression (13). Also, fx 

and fy are linearly related to the screen coordinates xsl and ysl, so that they may also be 
incrementally computed by adding a constant delta. Therefore, the incremental calculation of 
xs2 and ys2 requires the following, in addition to calculating zx2: 

- calculate zv2 reciprocal (i.e. divide 1 by zv2) 
30 - add delta to fx and add delta to fy 

- multiply both these results by zvl 

- add the two products to h*k*m03/2 and h*k*ml3/2 respectively 

- multiply both results by reciprocal zv2 
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- add to w/2 and h/2, respectively. 

The above adds up to eight additions, five multiplications and one 

division. 

Furthermore, many game applications will present one or more objects on 
the screen, which may observe a dynamic behaviour that is independent from camera move 
ment. This behaviour may be represented by a relative transform with respect to the camera 
for each such object that will usually be represented as one rigid body, or as a composite 
made up of various interconnected rigid bodies. Object movement can be realized by multi- 
plying the camera transformation matrix by the rigid-body transformation M obj for all 
pixels of the object in question. Thereupon, the displacement functions described earlier will 
calculate the motion vectors of the various pixels. Each pixel should have an identifier of the 
object to which it belongs, which identification should furthermore point to the final 
transformation matrix M=M cam .M obj of that object in the view-space of the most recently 
rendered source frame. This can be effected by introducing an extra description plane which 
contains object or matrix pointers. Usually the number of such rigid bodies in any scene will 
be less than 256, so one extra byte per pixel should be sufficient. The object plane can be 
efficiently compressed with runlength encoding RLE as usually the plane will contain large 
areas of uniform- valued pixels that belong to a single rigid body. 
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Define view coord inaes xv, vv. zv (0) 
Projection Matrix: 



n2 


0 


1+r 


0 






r-l 




0 


In 


b+t 
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t-b 


t-b 
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J+n 


2fti 






f-n 


~f-n 
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0 


-1 


0 



Symetrical Viewing Frustrum given by: 
b = -t; 

1 = -r (2 ) 

Use Height Angle fi and aspect ratio a to specify frustrum: k = 1/Tan[2 fi] 
a = w/h; 

t = n/k; (3) 
r = ta; 

Projection Coordinates: 

{xp, yp, zp, wp} = P . {xv, yv, zv, 1}; (4) 
Normalised Device Coordinates: 

{xn, yn} = {xp, yp} / wp; (5) 



Screen Coordinates: 

{xs, ys} = {I w (xn + 1), ih (yn + 1)}; 
which by substitution is equal to: 



(6) 



RECTIFIED SHEET (RULE 91) 



WO 99/51029 



PCT/1B99/00486 



h k xv h _ h k yv 
2 zv ' 2 2 zv 



(7) 



So, using {xvl, yvl, zvl} and {xsl, ysl} as the coordinates in the source frame, then {xvl, 
yvl} can be written as: 



{XV J, yvl} = [ (W - 2 h X k Sl) ZVl , iH - 2 ,f } ZVl } 

Matrix Transformation of the view coordinates due to camera and/or object changes: 



(8) 



M = 



mOO mOl m02 m03 

mlO mil ml 2 ml 3 

m20 mil m22 m23 

0 0 0 1 



(9) 



Coordinates in the intermediate frame: 

{xv2, yv2, zv2, wv2} = M . {xvl, yvl, zvl, 1}; 



(10) 



So: 

{»2..3W2}-|* 



h k xv2 h 
2 zv2 * 2 



h k yv2 
2 zv2 



} = 



(11) 



1 

— W 

2 



1- 



hk \m03 + m02zvl + m00(w - 2xsl)zvl 

hk 



w zv2 



m01(h-2ysl)zvl 1 
hk I 



I* 
2 



k\ml3+ml2zvl + 



wj70(w-2x57)zv/) mll(h-2ysl)zvl 



1 - 



/lit 



Zv2 



and zv2 is equal to: 

zv2 = m23 ♦ f W 22 + ^^-2»7) + ^27(^-2^7) 1 
[ TjA: M J 



(12) 



= m23 + fz zvl 



fz is linearly related to the screen coordinates xcl and ysl, so that fz can be incrementally 
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computed by adding a constant delta during scan conversion. Hence the incremental 
computation of zv2 requires: 

- add delta to fz 

- multiply by zvl 

- add to m23 

xs2 and ys2 can be rewritten as: 

f w _ hkm03/2+(hkm02 + m00(w-2xsI) + m01(h-2ysl))zvll2 

h _ hkml3!2 + {hkm!2 +mlO(w -2xsl) + mll{h-2ysl))zvll2 \ 
2 zvl J 

iw _ hkm03l2 + ficzvl h _ hkml3l2 + fyvzl \ 



Also fx and fy are linearly related to the screen coordinates xsl and ysl, such that they can 
be incrementally computed by adding a constant delta. So the incremental computation of xs2 
and ys2 requires (additionally to the compution of xv2): 

- calculate zv2 reciprocal (divide 1 by zv2) 

- add delta to fx and add delta to fy 

- multiply both results by zvl 

- add to h k m03/2 and h k ml 3/2 respectively 

- multiply both results by reciprocal zv2 

- add to w/2 and h/2 respectively. 




This sums up to: 1 div, 5 mults, and 8 adds! 



WO 99/51029 

CLAIMS: 



PCT/IB99/00486 



1- A method for displaying a first sequence of display frames, comprising 

the steps of: 

receiving a second sequence of source frames at a source frame rate, and 
synthesizing on the basis of said second sequence for a pair of immediately adjacent source 
5 frames one or more intermediate frames through using a geometrical transformation method, 

and being characterized in that each intermediate frame is based exclu- 
sively on an immediately preceding source frame, through extrapolation from camera 
transformations, and through on the basis of Z-buffer information, in-frame extrapolating 
uncovered deeper pixels from likewise deeper but at present continually visible such deeper 
10 pixels, 

for on the basis of a sequence of said intermediate frames, and as the case may 
be, also on the basis of said source frames selecting said display frames at a raised display 
frame rate with respect to said source frame rate. 

15 2. A method as claimed in Claim 1, whilst using known camera motion with 

respect to a most recent source frame for producing instantaneous coherence between pixels 
of the latter source frame and pixels of an immediately following synthesized display , frame. 



3. A method as claimed in Claim 1, and based on in-frame extrapolation that 

20 operates exclusively along with or counter to display lines scan motion. 

4 - A method as claimed in Claim 1, while furthermore displaying a solid 

body in an extrapolated frame according to a transformation matrix M = M cam *M obj . 

25 5. A method as claimed in Claim 1, whilst ignoring an uncovered item that 

relates to no continually visible pixel, until arrival of a next source frame that has such item 
as rendered. 

6 - A frame-based device arranged for displaying a first sequence of display 
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frames, and comprising: 

receiving means for receiving a second sequence of source frames at a source 
frame rate, and synthesizing means for synthesizing on the basis of said second sequence for 
a pair of immediately adjacent source frames one or more intermediate frames through using 
5 a geometrical transformation method, 

characterized in that said synthesizing means are arranged for basing each 
intermediate frame exclusively on an immediately preceding source frame, through 
extrapolation from camera transformations, and through on the basis of Z-buffer information, 
in-frame extrapolating uncovered deeper pixels from likewise deeper but at present continu- 
10 ally visible such deeper pixels, 

and outputting means fed by said synthesizing means for on the basis of a 
sequence of said intermediate frames, and as the case may be, also on the basis of said sour- 
ce frames outputting said display frames at a raised display frame rate with respect to said 
source frame rate. 

15 

7. A device as claimed in Claim 6, wherein said synthesizing means are 
arranged for receiving camera motion information with respect to a most recent source frame 
for producing instantaneous coherence between pixels of the latter source frame and pixels of 
an immediately following synthesized display frame. 

20 

8. A device as claimed in Claim 6, said synthesizing means having in- frame 
extrapolation means that operate exclusively along with or counter to display lines scan moti- 
on. 



25 9. A device as claimed in Claim 6, and having matrix multiplication means 

for displaying a solid body in an extrapolated frame according to a transformation matrix 

lot>r 



30 



10. A device as claimed in Claim 6, and having ignoring means for ignoring 

an uncovered item that relates to no continually visible pixel, until arrival of a next source 
frame that has such item as rendered. 
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